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[tmm i ] se^j#-^ i x-mztizr * ymnm, ttc 
\mr * smimctoi,*? i 1 u < \mm<or % m& 

U fro/3 - ^*y>ijgco3fi (fc\fcc>'/£/cB3' 

ft site?-, 

[If ) I2?ij#-^ 1 5 y^ffi?! ttc 

\mr ^ smzn&te^T l h l < B£MI©7 $ 
g&, X£> »AbL<Btttt]3ftfc7^ /»J?:W 

[3H#E3 ] efli## i vmznzr 5 -/hen. *fc 

X*k, *PAfcL<ttff*nS3hfcT5-/MEW*W 
*&L<BKB#&. 

fcMSit^^ S»^2*fcB3K*l5<D#£. 
[«*l5]SW/c(ifKWH7K ~>^>, F 
^03^, #>*yffl % tfctt Pha f f i 

[tS^j|7 ] is - ^D^v^jS^UAjrc^WAJi 
§L Zymomona s H f-fflB, Sacchar omy 
c e s «»g. ifctt Cand i d a Higgrfr If 

[0 00 1 ] CSHBOWJR) 
Wr4ft^»«:*«S**ArS»*Sra- Ft 

s^^A-rs^stcBB-r^o 

[0 0 0 2 ] 

[fi£*<Dl$aS] #P*-M F (carotenoid, ^Pf/^f 

ftX^i> (Pfander, H. , ed. ,Key to Carotenoids. Bas 
el, Birkhauser, 1987). J A FB, W9P^ 
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fc-SSM-f 1:3 ti, »*©WI4«*SBiS tir t»6 b© 

4S«fi8i*«fiStt, fk*i^, 28, 219-227, 199 

0)o 

[0003] tivfsj KB, XfPHk R 

SSSBfcJc^r^rtStiS. «fiJ(D-Y 77*uy A Fr* 
^c5(D^v^>r^;Ub'p y>M (ipp) B«ttftRl£ 
oc<fc 0 ^ ^;U7 u ibb'o y >|? (dmapp) &t^&3 
ft. s^tc. dmapp b. csoipp ffi£-f&c<t 

CCctO, C10 ©y^^l/t'oy >^ (GPP) V C15 (D7 7 

W^I/b'P'J >K (fpp), C20 ©y^^^y^^;ut* 
a«;>» cggpp) ic^^^cc; K^Sfc45 o^gtf 

ltc>< . ^?p^y^ h*wmm&&mmm. ggpp 

20 t>%. 2 »i=-(DGGPP*«ffl^Lr, ffinVijUf-Sj FT 
*Sfittfe<D7-r hx> (phytoene) ^ffiStlS. 
hx>B. ^ISSMtSJ£«:J:0, M^C. H«££*«A 
$n^C cbCCct 0, y ^ hy^X>(phytonuene ;7 

Fx>Ccr.mf^l ffl) . t- ^7P^> Ct-caroten 
e ; 1115^2 <i) , /«/PX^b> (neurosporene ; 
-S*S^3 <i) , U n^>(l V copene ; HM*S^4 ffl) 
CcSCJftStiS,. e6(c k ya^>B»ft5l£tcJ:0i8- 
^7P^> (i6-carotene) - tJU^l/ (a-caroten 
e) (Cgy^£ft£ 0 -ebt, ^nf>^a- ^7P^ 

30 ^c^KX^FS&i^ajlASn. -b'r^^>^> 
(zeaxanthin) . ;l^>f> (lutein) . 7X^W>f 
> (astaxanthin)«Ci'<Da^©*1f> F7 < Jl/**^SES 

[0004] ^n^y>r K<D£^fiR*ffl^aef<Dais 
B, i990^f-t«:Ao-c^iW(cji]gL/fc 0 JSfiE*rfc, 

^<CD*P^y Y Kfife^fltafiKF*. »MXt«E (epiphv 
tiplBB Erwinia ?S t"- v>^c iCDtl^^rte 

jfe^fiEHBai Rhodobacter , v >^ Svnechococ 
cus sp. strain PCC7942, ^7 b' Neurospora crassa ^ 

AFOLsmms, m&w mn mm, 4i, 337-346, 1 

996)) „ Ltc&-?X % ftm$titcm*®tin*s<<( F4 
3?>«:Btt«!tt£*}g»|ElftOI6 

^*#y 7^x>v>^ry>y©is^ -2. A 

50 Mlii-BecciS^Df-^-Y K4jS, fb^i^tt, 35, 
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60-68, 1997. tecfcO\ EjRmjg, JWisSJYZ.'S 

tfisVA Hfiggggl. 22- 32 , 1997) 0 
[0 0 0 5] (3- £P^>£/3- ?V 7*h*tf>^>£ 

/\4 Fu^ri/?— -if (/3 -carotene hydroxylase) £3 
-K-T€)ji£T (crtZ £/tBbhy) B. ttftftftttS 
Erwinia , Fl avobacteri um JgUgjM » j%ffi#ltt Aqrobacter 
ium aurantiacum , Alcali genes sp. strain PC-1, fll 

( Arabidopsis thai i ana ) fr6Btf#3 10 
*rtl>& (N. Misawa, Y. Satomi , K. Kondo, A. Yokoy 
ama, S. Kajiwara, T. Saito, T. Ohtani.W. Miki, St 
ructure and functional analysis of a marine bacter 
ial carotenoid biosynthesis qene cluster and astax 
an thin biosynthetic pathway proposed at the qene 1 
evel. 3. Bacteriol., 177, 6575-6584, 1995 , fcj: 
c>\ Z.Sun, E. Gantt, F. X. Cunninqham, Jr., Clonin 
q and functional analysis of the #-caortene hydro 
xylase of Arabidopsis thai i ana , J. Biol. Chem., 27 
1, 24349-24352, 1996 s fcAtf. L. Pasamontes, D. H 20 
uq, M. Tessier, H.-P.Hohmann, J. Schierle, A. P. 
G. A. M. van Loon, Isolation and characterization 
of the carotenoid biosynthesis qenes of Flavobacte 
rium sp. strainR1534, Gene, 185, 35-41, 1997) . C 

t\b<D&- #P*>'W Y a**>7--te'B. »*fiji 
x. T5^KiB9Ju^rj:<K#$hro^c. fctx 

«. Erwinia LMRi&M<DCrtZlZ 53-56 %<D|5]— (07 Z 
y MffiWZ M LX&<0. Ctlh (Omm t fij^ Arabidopsi 
s <D£- #P?->;W KO^^>7-tU 31-37 %(Dffl 

-Qrzsms&lZ^LX^tco 30 

[0 00 6] ijafsj KB. «3Ri 64 
6" #p^>" (fcix.ll 'J3^>, tf-^p^X 

OfflFHiS, ftaW^Pf -M KKJ:4«*» 

JgffcS, 67, 39-41, 1993), 40 
[0 00 7] CffflJ©«5) 

-42<bia«©ffitt*j#^&*sp>, cne><D5- #P?> 
t\ /3- i)u?^Q<o&- -f*y>J8(D3(4(fc<fctf/ 50 
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4fcB3' ffi)K:*«S«:a§AU /3-*'j7*h*tf>? 
[0 0 0 8] 

[f$liI£8?&1~&/t&O^I5] 7 >^ Synechocystis s 
p. strain PCC6803B. J l^WN&hhAL 3 tl 

fcPi— Ovl/UXbZ (T. Kaneko, S. Sato, H. Kotan 
i, A. Tanaka et al . , Sequence analysis of the qeno 
me of the unicellular cyanobacterium Synechocystis 

sp. strain PCC6803. II. Sequence determination o 
f the entire qenome and assiqnment of potential pr 
otein-codinq reqions. DNA Res. 3, 109-136, 1996) 

. —~fo. Synechocystis PCC6803B. #7 

7*1^>*>£ft£fa#><7)/3- *7P^>'W KP**>5 

-tf«e**w*3Brr*sa*. taratttt*©*g*. 

(0RF)Bl,tB$*l&rt>ofc. Synechocysti 
s PCC6803<£>£- #P?->/W FP + ^^-'fe'jtf£i 1 
B> K#0/3- #n*»W Ka^?— tf£B3Mfc< 

W#6B* Synechocystis PCC6803CD^V A±CC*£££ 
ftfc3,166 ffi|<D*>/**»*:3- Kl^£0RFCD$#> 
6. /3- *?u7->£/3- *'j:7'r*1t>*>£gT-te'7 

K-rsoRF sii i468^mffib. **w*^fi!E*rs 

[0 0 0 9] r^c^^. *«MCC < fc4aGTB 1 S2?iJ# 

L<&im2 titers smmmzmo, fr^e - a*; 

iejRJSttiWTS^'J^^F (/3 - iov^y^U Y 

$n^>7 ^ smm. ttcimr $ sws&nc*s^x 1 
sa&F-iiticc^ALrcti^iBBSi*. igi*©/3 

tc^Ai/rctii^s-ttS^i^oe - #p^>£/3 
- ^ g7>+tf>^>*feBcn4igt:-fe'7+if>5 L > 
z>ct&mt?z>i><Dx$>z>. 

[0 0 1 0] ±K<D#i£lcJ:«j. fcixfl 4>i^ifi£ 



5 

+ >>7^-fe'iie^2SA0r^, fBI^I^!fU^CO-SUp 

pression ^raHfcSWC-fi C chfr < . Cft6£>8S£ 

[ooii] mw^mmuwim 
*fctt»7 ^ ystse?ijccfci^r i & u < ^ii(DT io 

r* 0 , WttE*J## l tor 5 -/KBfll* 

SSicfelU CO*1f>h7 4A<0Wi*«6B, ±E 
a^*1t£fc*AUTCti*ISR3'tf, i*Si+(D/3 

[00 12]$MS(CMit, mz&s • 
*j^i/rc»s (S«urc»s) Jffa&WtLT 

a, Phaf f iaR8aftt"Ctt. /3 - *7P^>£j£ 

TcD^^rgifflscc^A-rntf J: 1^0 */c. it^fl - 30 

iou?y<< YQLSfSM&fr o rt E, crt B, c r t 
I, c r t YO^»3£fcB-^^gA-rS£^*^. 
**W«c*5^rH*U(r>iS±-c***li^ z 

ymomonas USUI. Saccharomyces 
Candid aKBS©»&«. ±ta<D*7P?- 
^ K^RHie^tBWLr^ftt**, crtEilf 

^c*<Dr*ti6oae^r^r*z»A'rsij!:*^* 40 

So 

[00 13] K&£fiSafrfr»S c r t 

E, crt B, crt I, c r t Ytt. £*0£>«*<Z>£ 

tct^mmmmE rwinia 
ttErwi nia uredovora), ftft Ifflffi 
(/cixllAg robacter ium a u rant 
iacum), Alcal igenes sp. str 
a i n PC - \mc&&tZ>b<DZmi<>ZCt&X' 

sawa et a 1. , J . Bacter iol. 1 50 
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72, 6704-6712. 1990. N. Misaw 
a et al. , J. Bacteriol. , 17 
7, 6575-6584, 1 9 9 5*&C!e«£*Vt(,> 

6 0 *(*»ccw^-rn«. c r t E^mmmmom^ 

#s|4E>7 * yiSSHH -30 2, c r t BttBflJS-Sf 
30)7$ ^S^S-^ 1-2 9 6, c r t I teieflJ8#5(P 
T^yiSS^l-4 9 2, c r t- YB:BWS# 6<P7 3 
-/»«#l-3 8 2 *yt«S?»J##2<Dl-3 8 6. r 

i <> l < \mmor ^ x*. #a 

ft**** *3-K*5«£**tt/B-rs 
[0014] *«M»e?-teJ: WJJBO^Scrt it£^ 

n*iu ton?? ')-frhmfcj-TJ¥<Dft$tvmm 

Shn^m «jtHQiaft^a-^(fc4jtit. ft 

[0015] <5>^Synechocystis sp. strain PCC68 
03(0^^ At#&>Synechocystis sp. strain PCC6S03 
( fcTF PCC6803) \t^mM\t<Dv>mX\ 1^3.6 to <D 

34, 786-792, 1996) 0 1996^02 /3&C\ Jlc^flt^ 

Wit or«*6*rrcc6803 y^A(D£ig*E?iJ#&S3 

tff^-^^raStl/c CT.Kaneko, S. Sato, H. Kotan 
i, A. Tanaka et al . , Sequence analysis of theqeno 
me of the unicellular cyanobacterium Synechocysti s 
sp. strain PCC6803. II. Sequence determination of 
the entire genome and assignment of potential pro 
tein-codinq reqions. DNA Res. 3, 109-136, 1996) 8 

m&vu, pcc6803 tctntzt^xKoysuismt. ft 

p : //ftp . kazusa . or . i p/pub/cyano/cyano . p . aa . 2 CC X. 7 

[0016] PCC6803(7)yy A±tCB. 3,166 <i(7)^> 
M-*K£3~KL5 5af£^H«« (0RF) tfiW&Stl 
ft. 3,166(0000^. Wft\<D&&+tmVMt£*Ltc<b 
©« l,742ffl (^f*055%) "C*0, ^OStttfie*^ 
iH"C*4 4>©tt i,402flr*^fc. Lfc^or, BKO© 

+ 2/7—^*3- K-TSORF sll 1468£i, C0*»©1 
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[0017] <?ym PCC6803CD|8- *7P*>'W KP 
+ ^7-tf«£^0Kf#>fl- ^7U^>^/3- £»J;/r 
+*>^>«Sr-b'7+1f>^>CCSffe'r*/3-*rn^ 
>>M Kd + 5'7-'tf&:a- K?&0RF sll 1468«\ * 
OtaSE9J©1f*R(T. Kaneko, S. Sato, H. Kotani, A. 

Tanaka et al . , DNA Res. 3, 109-136, 1996)CCfi^t> 

TOdw BM**5 Itffl^/CPCR ffitCcfc 0 . 

PCC6803 £>£fc£{*DN/\ Wtf^fcORF sll 146»DiBJiJ4* 
(£l/fc. 5'- TCC TCC AGC CTG TGC CAG GAG TCC G -3'5 

ACT CTA GAG CTA GGG CTT GTC AGA TG -3'C £ "CfS 
6n/cDNA KK-^XhoIAbal Vffiitfe. pBluescript II 
KS+ (Strataqene) OXhoI-Xbal BWiltJfAU 
TPCC6803 <D8- UU?>>\<4 FOt^7- -fe'S BPh & 

[0018] <5>?I PCC6803<D/3- tjxi9l/^4 Kn 

• jwttjeoors 6^ci¥L < saws. 
[0019] mfaomtfflkis&ifm&frfm 
«T«. »*bc>a4»^oMi-*aeT<z)»A-«?iffi 

©31 Ate J: «^<Dfc#<Z)*Ji& l>L^j&U. $#6§Jj CC 

^L^S (/cixtf. "Vectors for cloning genes", M 
ethods in Enzymology, 216, p. 469-631, 1992, Acade 
mic Press v teJ;c>\ "Other bacterial systems", Met 
hods in Enzymology, 204, p. 305-636, 1991, Academic 
Press MR) (C»DtWtt tl«<fcC». ffl^X.^^ 
«1 iA^n/c77^$ F^*Sli*J»tt3W> 

tbtom£®®mw<D%mcmbzttztf£i,> act 

Sambrook, 3., Fritsch.E. F. , Maniatis, T. , " 
Molecular cloninq -A laboratory manual." Cold Spri 
nq Harbor Laboratory Press, 1989, ItH&A 
56tfW3S3f , LIST OFCULTURES 10th Edition, 1996 

m) . 

[0 0 2 0 ] (DAUB 

BH^Tf crt E 4 crt B> crt I v crt Y <D*@A^ 

K. J. Sambrook, E. F. Fritsch, T. Maniatis, "Mol 
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ecular cloninq -A laboratory manual." Cold Sprinq 
HarborLaboratory Press, 1989 #HS) . AIiMr<D£f 

SuitiCD "Molecular cloninq -A laboratory manua 
l. n #JH) v tct&X, pUC ^pBluescript &M<D lac 

W'fSAlliBffl^* ^-pBluescript II KS f (Strataqe 
ne) £ffil>T, lac (D^O*- ^ -<Dfi^te cfcC/lacZ 
10 ©Jfflil?© l J-KXJl/-4SWSfla{C PCC6803CDORF sll 

1468 MBrf- (*«W©it(E?-*dto) *iJ:0'41(Dl 
an *#AU C<3MtfrP*rt 

[002 1] (2) Zymomonas mobilis 
Zymomonas mobilis "C», ^ft^SC-?- <Z>fd!CC_biecc> 4 
8© crtatGT-©«A*^S<tftS. x^y-iU^jgffl 
@§ Zymomonas mobilis ^CO^SlSTCDzgASii. ^ 

20 v Mmmic&mums&mmc iotf 5 c £#-c*. 

Zymomonas mobilis T'05^^jif£^<Z)^B 4 tctTilt 
Zymomonas mobilis ffl^ £ £ -pZA22£ffit>Ttf 5 C t 
( r Zymomonas SfflBCDfl^WllJ % 
0*ls(t^, 63, p. 1016-1018, 1989 . teefctf. 
N. Misawa, S. Yamano, H. Ikenaqa, Production of /3 
-carotene in Zymomonas mobilis and Aqrobacterium t 
umefaciens by introduction of the biosynthesis qen 
es from Erwinia uredovora. Appl. Environ. Microbio 
l. f 57, 1847-1849, 1991#M) o 

30 [ o o 2 2 ] ( 3) mm 

mmvte, *^0^«e^(Dft!jic±fscD4scDcrtae^ 

(D?SA#*i&ll£&5o SS Saccharomyces cerevisiae 
JH) o BSrO^jlG^OlSStt. PGK * GPD (GAP) 

3feaer- (*«wiBfis^*iJ:o f 4«o crater) 

(D^I^-fe y h£\ S. cerevisiae O^ZZ-^fct 

tlu. yrp x (i?s&&{*cdars ^jt«iiajS4-r* 

SSfflv;Uf-3t 4 -^^^-) . YEp « (BS©2iUffl 

yip s (»^a»je**^fc&i««ife&#aji*ffl 

^^3?-) ^O^^*-CcJ*Arsci«:J:0tf^ci 
50 K^SOteaxDite^X^j tej:^, s. y 
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amano, T. Ishii, M. Nakaqawa, H. Ikenaqa, N. Mi saw 
a, Metabolic enqineerinq for production of /3-car 
otene and lycopene in Saccharomyces cerevisiae. B 
iosci. Biotech. Biochem., 58, 1112-1114, 1994 # 
ft?.) . 

[0023] Ifg Candida u til is ^©^*jt(£T-©* 

m (isff. mm, i#gii¥s-i73i7o*i&«) tcft-, 

G4l8lfttilfc?. *4<,»tt/Wyn-7-r"S/>BJttSI 10 
*filW#CcUc&. mm^-^xSfcu < BU 

'zmamztitccAP, pgk. r*\ &t*©7"p*-$-?r 

[0 024] Pg Phaffia rhodozyma ^©?|-3fejt£+- 

J: CM «© crtl£+) ©iA'SKo 
1>"CB. Van Ooven ^3tl/c^3?£ (Van Oo 

yen etal . , Transformation of Phaffia rhodozyma, W09 20 
4/06918, 1994) G418ltffil£i 1 & £ "©SIR V 

[0025] a&fta>6©aP3VY Kejp;©?lffl • ffi 
Kg 

B. r-feh>. ^po-t^Afeio'cne 
iSJ8&. it¥te£Mi>")*]y>\' (cos fji 

[002 6] tt«©E1HSftfei:ffafc?ft3B 
W»©*£ttaj|tfl - *o*>*j^ort>*©-c. * 
HWiUc^ro^SriSA-f tiB<fct>. H5a>©KC6803 ©$ 40 
2£$/3- *D?>M-f FP + J^—tfjgfi-?- (ORF sll 
1468) ^t?77X$ FfcfBHU Chfchvh. ^> 
h^tani/. *^3ttf©attJtt 
»»«ci»Al/. M3«Ci(ai -te'7 
■^(3 - f V7 , h*^>5 : ->teJ:0'cn6©t/P7 i ^^ K 

bs^p^/w KSfl*it-?>ofco, *7P*w f©« 



$$B!¥ 1 1 -4677 0 
10 

[002 7 ] «Ttt. W^©»3MG*©»A • &m 

tf^7^ 5 K©ffi«ffi. WBl (MB) ^<D7vZ 5 F© 
jfAfc-JrOWS ©tea>©#JH«ci,>l/*ffi«. 
*^tctet,»rTI2t/c.ic5W?(-©fc©{ctei,»r4>. 

a»©ae-f i*©»wcc «t o ami s *vc (, > s <>©*£ 

if, IffllSI^WWiffAPI, 1992 #JH) JcW 

Driorf ttB«ti>. 

[oo28] wto^m*m&rf<m*&it. mmmm 

IffiM Aqrobacteri un tumefaciens ; §:j'r"3"5#i£. xU? 

swai6h-a>*. 3iAi/fcc»fit«j©««(cic»i;rcne. 

©#£&<£t,>#tt&C<!:#r£&#. igJi-Ca. Aqroba 
cterium tumefaciens Zfrt h-fi&lfi&^mZtlX l> 
Z. yp ^*iSI%3S *--C*l>* 
'J:7 7 7--t1f O^-ObX (CaM/) ©35S 7'P^-J* 
Or. a*©SB*a»«C^H-r5fc©fctt 
^Ct#tJ4. CaMY 35S :/n*-ar-£S/Uc><-f 
— i*i?-pBI121 B Clontech ILfcOA^T'^. 
Aqrobacterium tunef aci ens %ft't~ : b tc#)<D'<i> £ — £ 
L/T, iiXGfrbtlTt,* 5 *>©-(: #$,, Erwinia ©crtl 
&£©«»©* P?W K££JS»£*tt. PBI121 £ 

tt^icjstA-ct. cne©jte-7-*^^'=i©^ h v h 

(HiRKig, *P^y>f K*^BHI*S«filftttM!»©flP 
HJ, ffi^©[t^llSS5 , 31, 143-149, 1996). ftfc. C© 
BS. ««5*ffliaHrt-C^fiS3nfcCrt ^^^S*. t>U 
T-s-i K4«©»-C*SSUik»^>a3R(*& i'©7'5 X ^ 
KKi^'ff^-ti^©^. h^>^^ KSe?'J (Wlx. 

BRubisco ©/Js-^yax h© h ? h^^'? 1 Kffi 
?|J) ^crtjtfi^ (CC-Ctt*»W«&P) ©HHtSi K 
>©iW»K:ff4-rs*R**a *nfy-{ 
K*^«K«W«aatt»»©fttH, «*©{t!**MBS. 3 
1, 143-149, 1996). 

[00 29] l2?iJ#^3-6(CWri.jie^ i &^^ 
ZcABiffiB. TK©J: 5Km&*R£*l¥I*8:fR 
if^EBf tc F E R M BP-23 7 7 i Lr^FIt^nrte 
*> . K?'J##2 tcB-T Sjie^?rffl*iiA/cA8i»B B 
P-4 5 0 5 it X mi -BtiX (,>•&. 

( 1 ) Escherichia coTi 3NQ09(pCARl) 
SliS# : PERM BP-2377 
3tK^B : ¥«MS 1 1 B 

(2 ) Escherichia coli JM101(pAccrt-EIB,pAK92) 
SKS-^ : PERM BP-4505 
Sft^flH :¥fi£5^12^2 0H 
Mt* ( 1 ) B. Erwinia uredovora © crtE(zexA), crt 
B(zexE). crtKzexD). crtY(zexC)^©ji^£^ : •?:§^r- 



11 

&9 , ( 2 ) B> Aqrobacterium aurantiacum (D c 
[0030] 

(epiphytic) frfflB Erwinia uredovora (Pert 
E, crtB, crtl, crtYjgfc^Wf & V'^Ts \ FpAGCARl 

6AcrtxB*JiSC<:£- # p *>££i&T&f&ft£^;L 10 
£C£#T'££ (N. Misavva et al. t Structure and fu 
nctional analysis of a marine bacterial carotenoid 
biosynthesis qene cluster and astaxanthin biosynt 
hetic pathway proposed at the qene level. J. Bacte 
riol., 177, 6575-6584, 1995) „ C<D7'5X$ FB, 

fcWurteO, Jr<ttfcnn»SpBiuescript -^puc m 

FCC6803 ys&CD?* ^ Y?4 :/^'J-<DF*9 ORF sll 1 
468 ( mc@- i3Vt?l/^4 Yn^i/v-MS&rf- , bh 20 
y ft/c) F£P-> cs 0827 (T. 

Kaneko et al . , DNA Res. 3, 109-136, 1996 ) *iKS 

<bur^i>r, WT<oi*if(DNA Zy'vjv-tbxpc 

5'- TCC TCG AGC CTG TGC CAG GAG TCC G -3' 
5*- ACT CTA GAG CTA GGG CTT GTC AGA TG -3' 
[003 1] CCTi#6ft/t0. 94 kb (DVUAffiftZm 
Pggf^Xhol/Xbal H CjH{bf&. pBluescript II KS+ (7 
>£>";>Itt4, Strataqeneft) CDXhoI-Xbal 3Sfi[tC# 
A-TSC&ICJ: 9 FpBS-bhy &S12?iJ 30 

<D##T^CJ:9, pBS-bhy Ba#j£T£0RF sll 1468 

*^ri>ac£*«BLfc. sc*j, c(D^7x^ Ft 

B. ORF sll 1468 B> ^^-pBluescript II KS+ 
Olac 7'n*-£-<7)S??<D , j~ KX;b-**»4*:W 
r&<. pBluescript II KS* OlacZ <DSilR<D V — F 

[0 03 2 ] (IBM) 2) ffl^x.A!lS^^*r&/7 
Pr^ F<D[§J^ 

20CDV*vX^ FpACCAR16AcrtX4dJ:c>pB5-bhy £WT 40 
£A8§ffi£> 150 Mfl/ml C7)T>t^>';>. 30 juq/ml 
(D^P7A7x^3-Jk 0.1 irMD-f 7 7'P t*Jl/- 1 
- ht^y^F (IPIG) SrStfLBtg 

tft(l % F';7'h>, 0.5% + l%NaC 

i) -c3o-c-c24«ih. s»jw*"ce«*ffofc. mm 

B#*>6#p*-M FSfR£7-fe F>-CMtfJU & 
^aD7^^-y^-il/ (9:1) rutt 

F ^'7 7 < - (hplc) ifcBWIl^a-? r i^77 4 - 50 



If B3¥ 1 1 -46 77 0 

12 

(JLQ (DV-yyJlstbtCo HPLCB. Nova-pak HR 6 u 
ftyU (300 x 3.9 mm, Waters) £fl?C\ 1 ml/minCD 

iiffit?, F^F lUU-j* ^y-;l/-2-7'UMV-Jlx 
(90:6:4) ■CJSBBSrffofc. TLC B, ^V*?^ (60F2 

54) £fflt,\ 2aa7 *)l&-JZS-)l> (15:1) "CR 

ra^rtfo/c 0 i3-^Df> (all-trans SS) BSigma f± 

+r>^> (all -trans M) % ^ 7* F 

(all -trans SS) , #>2*1J->*> (all-trans M) 

, (all-trans S)B, (N. 

Misawa et al . , J. Bacteriol . , 177, 6575-6584, 199 
5) GCfi£r>"C, Erwinia , £fcBErwinia lMftMM<Dc 

rt m&ttmmfrp>ttm£ ft -?tc 0 

[0 0 3 3 ] pACCAR16AcrtX^J:c>'pBS-bhy £ltt&A: 

HPLCteiO'TLC ^UfcjfSS, -fe'7+1f>^> (all-tr 
ans^) (£{*<D65%) % e-^U^F+^>^> (all-t 
rans 20 ( £{*<05 %) % /S- ^n^V (all-trans 

§o (^(*Q24%) comsmv&ztm&ztitc. & 

te, ff- ^af>W hS^aSA^ti/c^a^-y^ FT- 

&#>ofc 0 LWot, 7*^,X 5 FpBS-bhy(C^$n^0 
RF sll 1468 tt, /3- *7U^>£Sfl<!:bT, fi- * 

*1f>^><D3 , atC*»«^*AStl/c/7D^> r ^ FT 
*S427*1f>^>S:^fiS'rSB*j8- *D^>^W F 
n + ^> ^ - -fe' ( £ -carotene hydroxyl ase) FT 

-sae^-rAsci^fc^^fc. ^&t>%. orf sii 14 

68 B % <{*s>m<D 3tt (3'{4) CC7K^S^^A 
[0 0 3 4] C(DM^<It^W&CiC<!:t*^ 

K-r*ae^ ccrtz *fc«bh V ) b. »^affl«Er 

winia . Flavobacterium HJfflffi, iSffilfflgi Aqrobacteri 
um aurantiacum , Alcaliqenes sp. strain PC-1 , 
^>D^tXt (Arabidopsis thai i ana ) frttVtiCfR 
Vl$tVCIi*&& (N. Misawa, et. al . , 3. Bacteriol., 

177, 6575-6584, 1995, te<fcO\ Z. Sun, et. al . , J. 

Biol. Chem. , 271, 24349-24352, 1996 k fccfc^, L. 
Pasamontes, et al . , Gene, 185, 35-41, 1997), tft 
6<Di8- ^n^>^-/ Fn+^7-feU Suffix. 
he^J u^r* J: < Sff 5 nr z> c t tffrfr ~> 

rc>/c 0 tctX\t. Erwinia i*#fflffl(DCrtZ«53-56 

til^ Arabidopsis <D/3- *Pf>^-f Fa-tv^— 12 
«, 31-37 %<7>|5)-0T^ ^»iH9J*WLr^fc. - 
7>SS PCC6803 <D/8- FO*^9- 



(8) !$M¥ 1 1 -46 77 0 

14 

w?\ orf sn vmx, a- *n^>*-h9— fe'rttfc 
r * * c 4 u /cot* o /Co 

[0 0 3 5] 

EW»* : 1 
EJUOfiS : 9 39 

iBytf<Z>«S : Canonic DNA 



10 



*fe$9£ : Synechocystis sp. 
tfcg:PCC6803 

ffoD#S* : /3 -carotene hydroxylase /ctebhy 

<aef*> 



48 
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6, PCC6803 CD ORF slll468#n- Kf-S * 
ti. jftffiiffl gf Aqrobac teri um au ran ti acum fo Alcaliqen 
es sp. strain PC-1 CDCrtW -Nijg Hagnatococcus pi 
uvialis CDBKT <ht>ofc/3-#a3 1 ->^r r ^--t£ (/3- 
carotene ketolase) (S. Kajiwara, T. Kakizono, T. 
Saito, K. Kondo, T. Ohtani , N. Nishio, S. Naqai , 
N. Misawa, Isolation and functional identification 

of a novel cDNA for astaxanthin biosynthesis from 

Haeroatococcus pluvial is , and astaxanthin synthesi 
s in Escherichia coli . Plant Mol . Biol . , 29, 343- 

352, 1995) 4r5yBH3Wu^ur«««^=HRItt* 

WOrC^c. ZCX\ §tW%h&. ORFsll 1468**. 0 

em 

CTG TOC CAG GAG TCC CTC ATA GTA ATG CAG GCG ACC CAA CCG CTG CAA 
Met Cys Gin Glu Ser Val He Val Met Gin Ala Thr Gin Pro Leu Gin 

15 10 15 

ACC GTT TCC CAA GCT GTC CCA AAA GAG TTT TTA CAG GCG GAC GGC GGC 
Thr Val Ser Gin Ala Val Pro Lys Glu Phe Leu Gin Ala Asp Gly Gly 

20 25 30 

TTC AAT CCC AAC GTG GCC ATG TTC CGG ATA GCT ATT CTC TTA ATG CTC 
Phe Asn Pro Asn Val Ala Met Phe Gly He Ala He Leu Leu Met Leu 

35 40 45 

GCT AAC GTT TTT GGC TAC TGG CAA TGG GGG CTG CCC CAC TGG CTT TGT 
Ala Asn Val Phe Gly Tyr Trp Gin Trp Gly Leu Pro His Trp Leu Cys 

50 55 60 

TTT ACT TGT TCG CTG CTG GCG CTG CAC CTG TCA GCC ACA CTG ATC CAT 
Phe Ser Cys Ser Val Leu Ala Leu His Leu Ser Gly Thr Val He His 
65 70 75 80 

GAT GCA TCC CAC AAT GCG GCC CAT CGG AAC ACC ATT ATT AAT CCA GTG 
Asp Ala Ser His Asn Ala Ala His Arq Asn Thr He He Asn Ala Val 

85 90 95 

CTT GGC CAC GCT ACT GCC TTA ATG TTG GGC TTT GCT TTT CCC CTC TTT 
Leu Gly His Gly Ser Ala Leu Met Leu Gly Phe Ala Phe Pro Val Phe 

100 105 110 

ACC CGG GTT CAT CTC CAA CAC CAC GCC AAC CTC AAT GAC CCT CAA AAT 
Thr Arq Val His Leu Gin His His Ala Asn Val Asn Asp Pro Glu Asn 

115 120 125 

CAC CCA GAC CAT TTT CTT TCC ACC GGC GCT CCC CTC TTC CTC ATT GCC 
Asp Pro Asp His Phe Val Ser Thr Gly Gly Pro Leu Phe Leu He Ala 

130 135 140 

GCC CGG TTC TTC TAC CAT GAC ATC TTT TTC TTT AAA CGG CGG TTA TGG 
Ala Arq Phe Phe Tyr His Glu He Phe Phe Phe Lys Arq Arq Leu Trp 
145 150 155 160 

CGC AM TAT GAG CTA CTA GAG TGG TTC TTA ACT CGG CTT CTG TTG TTC 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



(9) ftPB¥ 1 1 -4 6 7 7 0 

15 16 
Arq Lys Tyr Glu Leu Leu Glu Trp Phe Leu Ser Arq Leu Val Leu Phe 

165 170 175 

ACC ATC CTT TTT CTC GGC ATT CAT TAC GGC TTT ATC GGC TTT CTC ATG 576 
TTir He Val Phe Leu Gly He His Tyr Gly Phe He Gly Phe Val Met 

180 185 190 

AAT TAC TGG TTT CTG CCT GCT TTA ATT GTT GGC ATT GCC CTG GGA CTG 624 
Asn Tyr Trp Phe Val Pro Ala Leu He Val Gly He Ala Leu Gly Leu 

195 200 205 

TTT TTT GAT TAC CTG CCC CAT CGA CCT TTC CAA GAA CCC AAC CGT TGG 672 
Phe Phe Asp Tyr Leu Pro His Arq Pro Phe Gin Glu Arq Asn Arq Trp 

210 215 220 

AAA AAT GCC AGG GTT TAT CCC AGC CCC ATT TTA AAT TGG CTC ATT TTC 720 
Lys Asn Ala Arq Val Tyr Pro Ser Pro He Leu Asn Trp Leu He Phe 
225 230 235 240 

GGG CAA AAT TAC CAC CTG ATC CAC CAC CTT TGG CCT TCT ATT CCT TGG 768 
Gly Gin Asn Tyr His Leu He His His Leu Trp Pro Ser He Pro Trp 

245 250 255 

TAT CAC TAC CAA AAC ACC TAT CAC ATC ACC AAG CCC ATT TTC GAT CAG 816 
Tyr Gin Tyr Gin Asn Thr Tyr His He Thr Lys Pro He Leu Asp Glu 

260 265 270 

AAG GCT TOT GAT CAA TCC CTG GGA TTA CTG CM GGG AAA AAT TTC TGG 864 
Lys Gly Cys Asp Gin Ser Leu Gly Leu Leu Glu Gly Lys Asn Phe Trp 

275 280 285 

AGC TTC CTC TAT GAT CTT TTC CTT GCT ATT CGT TTT CAC GGC CAT AAT 912 
Ser Phe Leu Tyr Asp Val Phe Leu Gly He Arq Phe His Gly His Asn 

290 295 300 

AAT TCT CAA TCA TCT GAC AAG CCC TAG 939 
Asn Ser Gin Ser Ser Asp Lys Pro*** 
305 

[0 03 6 ] E5U##: 2 ' 30*it§^: 

§2^J<£>S3 : 1 1 6 1 *fe1J?£ : Aqrobacterium aurantiacum 

ewoss : «©» - n*n ew©»» : 

b # P V - : mm ffiOtWR : crtY iM&tt > 

K5»J<D«SI : Genomic DNA * 

CTG ACC CAT GAC CTG CTG CTG GCA GGG GCG GGC CTT GCC AAC GGG CTG 48 
Met Thr His Asp Val Leu Leu Ala Gly Ala Gly Leu Ala Asn Gly Leu 

15 10 15 

ATC GCC CTG GCG CTG CGC GCG GCG CGG CCC CAC CTG CCC CTG CTG CTG 96 
He Ala Leu Ala Leu Arq Ala Ala Arq Pro Asp Leu Arq Val Leu Leu 

20 25 30 

CTG GAC CAT GCC GCA GGA CCG TCA GAC GGC CAC ACC TGG TCC TGC CAC 144 
Leu Asp His Ala Ala Gly Pro Ser Asp Gly His Thr Trp Ser Cys His 

35 40 45 

CAC CCC GAC CTG TCG CCG GAC TGG CTG GCG CGG CTG AAG CCC CTG CGC 192 
Asp Pro Asp Leu Ser Pro Asp Trp Leu Ala Arq Leu Lys Pro Leu Arq 

50 55 60 

CGC GCC AAC TGG CCC GAC CAC GAG GTG CGC TTT CCC CGC CAT GCC CGG 240 
Arq Ala Asn Trp Pro Asp Gin Glu Val Arq Phe Pro Arq His Ala Arq 
65 70 75 80 



(10) «f|B¥ 1 1 -46 77 0 

17 18 
CGC CTG GCC ACC OCT TAC COG TCG CTG GAC OGG GCG GCG CTG GCG GAT 288 
Arq Leu Ala Thr Gly Tyr Gly Ser Leu Asp Cly Ala Ala Leu Ala Asp 

85 90 95 

GCG CTG GTC COG TCG GCC GCC GAG ATC CGC TGG GAC ACC GAC ATC GCC 336 
Ala Val Val Arq Ser Gly Ala Glu He Arq Trp Asp Ser Asp lie Ala 

100 105 110 

CTG CTG GAT GCG CAG GGG GCG ACG CTG TCC TGC GGC ACC CGG ATC GAG 384 
Leu Leu Asp Ala Gin Gly Ala Thr Leu Ser Cys Gly Thr Arq He Glu 

115 120 125 

GCG GGC GCG GTC CTG GAC GOG CGG GGC GCG CAG CCG TCG CGG CAT CTG 432 
Ala Gly Ala Val Leu Asp Gly Arq Gly Ala Gin Pro Ser Arq His Leu 

130 135 140 

ACC CTG OCT TTC CAG AAA TTC CTG OCT GTC GAG ATC GAG ACC GAC CGC 480 
Thr Val Gly Phe Gin Lys Phe Val Gly Val Glu He Glu Thr Asp Arq 
145 150 155 160 

CCC CAC GGC CTG CCC CGC CCG ATG ATC ATG GAC GCG ACC GTC ACC CAG 528 
Pro His Gly Val Pro Arq Pro Met He Met Asp Ala Thr Val Thr Gin 

165 170 175 

CAG GAC GGG TAC CGC TTC ATC TAT CTG CTG CCC TTC TCT CCG ACG CGC 576 
Gin Asp Gly Tyr Arq Phe He Tyr Leu Leu Pro Phe Ser Pro Thr Arq 

180 185 190 

ATC CTG ATC GAG GAC ACG CGC TAT TCC GAT GGC GGC GAT CTG GAC CAC 624 
He Leu He Glu Asp Thr Arq Tyr Ser Asp Gly Gly Asp Leu Asp Asp 

195 200 205 

GAC GCG CTG GCG GCG GCG TCC CAC GAC TAT GCC CGC CAG CAG GGC TGG 672 
Asp Ala Leu Ala Ala Ala Ser His Asp Tyr Ala Arq Gin Gin Gly Trp 

210 215 220 

ACC GGG GCC GAG CTC CGG CGC GAA CGC GGC ATC CTT CCC ATC GCG CTG 720 
Thr Gly Ala Glu Val Arq Arq Glu Arq Gly He Leu Pro He Ala Leu 
225 230 235 240 

GCC CAT GAT GCG GCG GGC TTC TGG GCC GAT CAC GCG GCG GGG CCT CTT 768 
Ala His Asp Ala Ala Gly Phe Trp Ala Asp His Ala Ala Gly Pro^Val 

245 250 255 

CCC CTG GGA CTG CGC GCG GGG TTC TTT CAT CCG CTC ACC GGC TAT TCG 816 
Pro Val Gly Leu Arq Ala Gly Phe Phe His Pro Val Thr Gly Tyr Ser 

260 265 270 

CTG CCC TAT GCG GCA CAG CTG GCG GAC CTG CTG GCG GCT CTG TCC GGG 864 
Leu Pro Tyr Ala Ala Gin Val Ala Asp Val Val Ala Gly Leu Ser Gly 

275 280 285 

CCG CCC GGC ACC GAC GCG CTG CGC GGC GCC ATC CGC GAT TAC GCG ATC 912 
Pro Pro Gly Thr Asp Ala Leu Arq Gly Ala He Arq Asp Tyr Ala lie 

290 295 300 

GAC CGG GCG CGC CGC GAC CGC TTT CTG CGC CTT TTG AAC CGG ATG CTG 960 
Asp Arq Ala Arq Arq Asp Arq Phe Leu Arq Leu Leu Asn Arq Met Leu 
305 310 315 320 

TTC CGC GGC TGC GCG CCC GAC CGG CGC TAT ACC CTG CTG CAG CGG TTC 1008 
Phe Arq Gly Cys Ala Pro Asp Arq Arq Tyr Thr Leu Leu Gin Arq Phe 

325 330 335 

TAC CGC ATG CCC CAT GGA CTG ATC GAA CGG TTC TAT CCC GGC CGG CTG 1056 
Tyr Arq Met Pro His Gly Leu He Glu Arq Phe Tyr Ala Gly Arq Leu 



[0037] 



19 

340 

ACC CTG CCC GAT CAG 
Ser Val Ala Asp Gin 
355 

OT GGC ACG GCC ATC 
Leu Gly Thr Ala lie 

370 
AAC CCA TGA 
Asn Ala *** 
385 

ems# : 3 

8 9 1 



Genomic DMA 

ATG CCA 
Met Ala 
1 

OCA AAA 
Ala Lvs 

TGT GAC 
Cys Asp 

OCT CCC 
Pro Ala 
50 

ACG CCC 
Thr Arq 
65 

GCT TTT 
Ala Phe 

TTT GAT 
ftie Asp 

AGC CAA 
Ser Gin 

GTC GOC 
Val Gly 
130 
CTG GC 
Leu Asp 
145 

GCT CGC 
Ala Arq 



(11) 

345 350 
CTG CGC ATC CTG ACC GGC AAG CCT CCC ATT CCC 
Leu Arq He Val Thr Gly Lys Pro Pro lie Pro 

360 365 
CGC TGC CTG CCC GAA CCT CCC CTG CTG AAG CAA 
Arq Cys Leu Pro Glu Arq Pro Leu Leu Lys Glu 
375 380 



: Erwinia uredovora 
ttQflHR: crtB 



1»W 1 1 -46 7 7 0 
20 

1104 



GTT GGC TCG 
Val Gly Ser 
5 

ACC CGG CGC 
Thr Arq Arq 
20 

GAT GTT ATT 
Asp Val He 
35 

TTA CAA ACG 
Leu Gin Thr 

CAG GCC TAT 
Gin Ala Tyr 

CAG GAA CTG 
Gin Glu Val 
85 

CAT CTG GAA 
His Leu Glu 

100 
CTG GAT GAT 
Leu Asp Asp 
115 

TTG ATG ATG 
Leu Met Met 

CGC GCC TGT 
Arq Ala Cys 



AAA AGT TTT GCG 
Lys Ser Phe Ala 



GAT ATT GTG 
Asp He Val 
165 

COX AGC TGG CTG GAG 



AGC GTA 
Ser Val 

GAC GAT 
Asp Asp 

CCC GAA 
Pro Glu 
55 

CCA GCA 
Ala Gly 

70 
GCT ATG 
Ala Met 

GGC TTC 
Gly Phe 

ACG CTG 
Thr Leu 

GCG CAA 
Ala Gin 
135 
GAC CTT 
Asp Leu 
150 

CAC GAT 
Asp Asp 



CTG ATG 
Leu Met 
25 

CAG ACG 
Gin Thr 

40 
CAA COT 
Gin Arq 

TCG CAG 
Ser Gin 

GCT CAT 
Ala His 

GCC ATG 
Ala Met 
105 
CGC TAT 
Arq Tyr 
120 

ATC ATG 
He Met 



ACA GCC 
Thr Ala 
10 
CTC TAC 
Leu Tyr 

CTG GGC 
Leu Gly 

CTG ATG 
Leu Met 

ATG CAC 
Met His 
75 

GAT ATC 
Asp He 

90 
GAT GTA 
Asp Val 

TGC TAT 
Cys Tyr 

GGC GTG 
Gly Val 



CGC CTG GCA TTT 
Gly Leu Ala Phe 
155 

GCG CAT GCG GGC 
Ala His Ala Gly 
170 

CAT GAA GCT CTG AAC AAA 



TCA AAG TTA 
Ser Lys Leu 

GCC TGG TGC 
Ala Trp Cys 
30 

TTT CAG GCC 
Phe Gin Ala 
45 

CAA CTT GAG 
Gin Leu Glu 
60 

GAA CCG GCG 
Glu Pro Ala 

GCC CCG GCT 
Ala Pro Ala 

CGC GAA GCG 
Arq Glu Ala 
110 

CAC GTT GCA 
His Val Ala 

125 
CGG GAT AAC 
Arq Asp Asn 
140 

CAG TTG ACC 
Gin Leu Thr 



TTT GAT 
Phe Asp 
15 
CGC CAT 
Arq His 

CGG CAG 
Arq Gin 

ATG AAA 
Met Lys 

TTT GCG 
Phe Ala 
80 

TAC GCG 
Tyr Ala 

95 
CAA TAC 
Gin Tyr 

GGC GTT 
Gly Val 

GCC ACG 
Ala Thr 



AAT ATT 
Asn lie 
160 
CTG CCG 
Leu Pro 
175 

GAG AAT TAT GCG GCA 



1152 



1161 



CGC TGT TAT 
Arq Cys Tyr 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



(12) ftRPF 1 1 - 4 6 7 7 0 

21 22 
Ala Ser Trp Leu Glu His Glu Civ Leu Asn Lys Glu Asn Tyr Ala Ala 

180 185 190 

CCT GAA AAC CCT CAC OCG CTC AGC CGT ATC GCC CGT CGT TTG GTG CAC 624 
Pro Glu Asn Arq Gin Ala Leu Ser Arq lie Ala Arq Arq Leu Val Gin 

195 200 205 

GAA GCA GAA CCT TAC TAT TTG TCT GCC ACA GCC GGC CTG GCA GGG TTG 672 
Glu Ala Glu Pro Tyr Tyr Leu Ser Ala Thr Ala Gly Leu Ala Gly Leu 

210 215 220 

CCC CTG CGT TCC GCC TGG GCA ATC GCT ACG GCC AAG CAG GTT TAC CGG 720 
Pro Leu Arq Ser Ala Trp Ala lie Ala Thr Ala Lys Gin Val Tyr Arq 
225 230 235 240 

AAA ATA GGT GTC AAA GTT GAA CAG GCC GCT CAG CAA GCC TGG GAT CAG 768 
Lys He Gly Val Lys Val Glu Gin Ala Gly Gin Gin Ala Trp Asp Gin 

245 250 255 

CGG CAG TCA ACG ACC ACG CCC GAA AAA TTA ACG CTG CTG CTG GCC GCC 816 
Arq Gin Ser Thr Thr Thr Pro Glu Lys Leu Thr Leu Leu Leu Ala Ala 

260 265 270 

TCT GGT CAG GCC CTT ACT TCC CGG ATG CGG GCT CAT CCT CCC CGC CCT 864 
Ser Gly Gin Ala Leu Thr Ser Arq Met Arq Ala His Pro Pro Arq Pro 

275 280 285 

GCG CAT CTC TGG CAG CGC CCG CTC TAG 891 
Ala His Leu Trp Gin Arq Pro Leu * 
290 295 

[0038] e : 4 * mm • 

E9Ufl>fi-3 : 9 0 9 : Ervrinia uredovora 

E&iom : mom : Z#lft l»»JO»* : 

h#nt?- : mm fltoflHR : crtE «■£*■«) 

n&iomM : Genomic DMA * 

ATG ACG GTC TGC GCA AAA AAA CAC GTT CAT CTC ACT CGC GAT GCT GCG 48 
Met Thr Val Cys Ala Lys Lys His Val His Leu Thr Arq Asp Ala Ala 

1 5 10 15- 

GAG CAG TTA CTG GCT GAT ATT GAT CGA CGC CTT GAT CAG TTA TTG CCC 96 
Glu Gin Leu Leu Ala Asp He Asp Arq Arq Leu Asp Gin Leu Leu Pro 

20 25 30 

GTG GAG GGA GAA CGG GAT CTT GTG GGT GCC GCG ATG CGT GAA GCT GCG 144 
Val Glu Gly Glu Arq Asp Val Val Gly Ala Ala Met Arq Glu Gly Ala 

35 40 45 

CTG GCA CCG GGA AAA CGT ATT CGC CCC ATG TTG CTG TTG CTG ACC GCC 192 
Leu Ala Pro Gly Lys Arq lie Arq Pro Met Leu Leu Leu Leu Thr Ala 

50 55 60 

CGC GAT CTG GGT TGC GCT GTC AGC CAT GAC GGA TTA CTG GAT TTG GCC 240 
Arq Asp Leu Gly Cys Ala Val Ser His Asp Gly Leu Leu Asp Leu Ala 
65 70 75 80 

TCT GCG GTG GAA ATG GTC CAC GCG GCT TCG CTG ATC CTT GAC GAT ATG 288 
Cys Ala Val Glu Met Val His Ala Ala Ser Leu He Leu Asp Asp Met 

85 90 95 

CCC TGC ATG GAC GAT GCG AAG CTG CGG CGC GGA CGC CCT ACC ATT CAT 336 
Pro Cys Met Asp Asp Ala Lys Leu Arq Arq Gly Arq Pro Thr lie His 
100 105 110 



(13) fiWP 1 1 -4 6 7 7 0 

23 24 
TCT CAT TAC CGA GAG CAT CTG GCA ATA CTG GCG GCG GTT aC TTG CTG 384 
Ser His Tyr Gly Glu His Val Ala He Leu Ala Ala Val Ala Leu Leu 

115 120 125 

ACT AAA GCC TTT GGC GTA ATT GCC GAT GCA GAT GGC CTC ACG CCG CTG 432 
Ser Lys Ala Phe Gly Val He Ala Asp Ala Asp Gly Leu Thr Pro Leu 

130 135 140 

GCA AAA AAT CGG GCG GTT TCT GAA CTG TCA AAC GCC ATC GGC ATG CAA 480 
Ala Lys Asn Arq Ala Val Ser Glu Leu Ser Asn Ala He Gly Met Gin 
145 150 155 160 

GGA TTG GTT CAG GGT CAG TTC AAG GAT CTG TCT GAA GGG GAT AAG CCG 528 
Gly Leu Val Gin Gly Gin Phe Lys Asp Leu Ser Glu Gly Asp Lys Pro 

165 170 175 

GGC AGC GCT GAA GCT ATT TTG ATG ACG AAT CAC TTT AAA ACC AGC ACG 576 
Arq Ser Ala Glu Ala He Leu Met Thr Asn His Phe Lys Thr Ser Thr 

180 185 190 

CTG TTT TCT GCC TCC ATG CAG ATG GCC TCG ATT GTT GCG AAT GCC TCC 624 
Leu Phe Cys Ala Ser Met Gin Met Ala Ser lie Val Ala Asn Ala Ser 

195 200 205 

AGC GAA GCG CGT GAT TGC CTG CAT CGT TTT TCA CTT GAT CTT GGT CAG 672 
Ser Glu Ala Arq Asp Cys Leu His Arq Phe Ser Leu Asp Leu Gly Gin 

210 215 220 

GCA TTT CAA CTG CTG GAC GAT TTG ACC GAT GGC ATG ACC GAC ACC GGT 720 
Ala Phe Gin Leu Leu Asp Asp Leu Thr Asp Gly Met Thr Asp Thr Gly 
225 230 235 240 

AAG GAT AGC AAT CAG GAC GCC GGT AAA TCG ACG CTG CTC AAT CTG TTA 768 
Lys Asp Ser Asn Gin Asp Ala Gly Lys Ser Thr Leu Val Asn Leu Leu 

245 250 255 

GGC CCG AGG GCC GTT CM GAA CGT CTG AGA CAA CAT CTT CAG CTT GCC 816 
Gly Pro Arq Ala Val Glu Glu Arq Leu Arq Gin His Leu Gin Leu Ala 

260 265 270 

ACT GAG CAT CTC TCT GCG GCC TGC CAA CAC GCG CAC GCC ACT CAA CAT 864 
Ser Glu His Leu Ser Ala Ala Cys Gin His Gly His Ala Thr Gin. His 

275 280 285 

TTT ATT CAG GCC TGG TTT GAC AAA AAA CTC GCT GCC CTC ACT TAA 909 
Phe He Gin Ala Trp Phe Asp Lys Lys Leu Ala Ala Val Ser * 
290 295 300 
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ATG AAA CCA ACT ACG GTA ATT GGT GCA GGC TTC GCT GGC CTG GCA CTG 48 
Met Lys Pro Thr Thr Val He Gly Ala Gly Phe Gly Gly Leu Ala Leu 

1 5 10 15 

GCA ATT CGT CTA CAA GCT GCG GGG ATC CCC GTC TTA CTG CTT GAA CAA 96 
Ala He Arq Leu Gin Ala Ala Gly He Pro Val Leu Leu Leu Glu Gin 

20 25 30 

CGT GAT AAA CCC GGC GCT CGG GCT TAT GTC TAC GAG GAT CAG GGG TTT 144 
Arq Asp Lys Pro Gly Gly Arq Ala Tyr Val Tyr Glu Asp Gin Gly Phe 



25 

35 

ACC TTT CAT GCA 
Thr Phe Asp Ala 
50 

CAA aC TTT CCA 
Clu Leu Phe Ala 
65 

CTC CCC GTT ACG 
Leu Pro Val Thr 



TTT AAT 
Fhe Asn 

TTT AAT 
Phe Asn 

CCC CCC 
Arq Ala 
130 
TTA TCC 
Leu Ser 
145 

CAC CCA 
Gin Ala 

CAA CAT 
Glu His 

AAT CCC 
Asn Pro 

CCT GAG 
Arq Glu 
210 
CAG GGG 
Gin Gly 
225 

AAC GCC 
Asn Ala 

GTG CAT 
Val His 

AAT GCA 
Asn Ala 

GCC GCC 
Ala Ala 
290 
TCT CTC 



TAC GAT 
Tyr Asp 
100 
CCC CGC 
Pro Arq 
115 

GTG TTT 
Val Phe 

TTC AGA 
Phe Arq 

TGG AGA 
Trp Arq 

CTC CGC 
Leu Arq 
180 
TTC CCC 
Phe Ala 
195 

TGG CCC 
Trp Gly 

ATG ATA 
Met lie 

AGA CTC 
Arq Val 

TTA CAG 
Leu Glu 
260 
GAT GTG 
Asp Val 
275 

GTT AAG 
Val Lys 



CGC CCG 
Gly Pro 

CTG GCA 
Leu Ala 
70 

CCG TTT 
Pro Phe 
85 
AAC GAT 
Asn Asp 

GAT CTC 
Asp Val 

AAA CAA 
Lys Glu 

GAC ATG 
Asp Met 
150 
AGC GTT 
Ser Val 
165 

CAG GCG 
Gin Ala 

ACC TCA 
Thr Ser 

CTC TGG 
Val Trp 

AAG CTG 
Lys Leu 
230 
AGC CAT 
Ser His 
245 

GAC GGT 
Asp Gly 



(14) 

40 

ACG GTT ATC ACC 
Thr Val lie Thr 
55 

GGA AAA CAC TTA 
Gly Lys Gin Leu 



TAC CCC 
Tyr Arq 

CAA ACC 
Gin Thr 

GAA GGT 
Glu Gly 
120 
CGC TAT 
Gly Tyr 
135 

CTT CGC 
Leu Arq 

TAC ACT 
Tyr Ser 

TTT TCT 
Phe Ser 

TCC ATT 
Ser He 
200 
TTT CCG 
Phe Pro 
215 

TTT CAG 
Phe Gin 

ATG GAA 
Met Glu 

CGC AGG 
Arq Arq 



CTC TCT 
Leu Cys 
90 

CGG CTC 
Arq Leu 
105 

TAT CCT 
Tyr Arq 

CTA AAG 
Leu Lys 

GCC GCA 
Ala Ala 

AAG GTT 
Lys Val 
170 
TTC CAC 
Phe His 
185 

TAT ACG 
Tyr Thr 

CCT GCC 
Arq Gly 

GAT CTC 
Asp Leu 

ACG ACA 
Thr Thr 
250 
TTC CTG 
Phe Leu 
265 

CGC CAC 
Arq Asp 



GTT CAT ACC TAT 
Val His Thr Tyr 
280 

CAG TCC AAC AAA CTG CAG 
Gin Ser Asn Lys Leu Gin 
295 

TTT GTG CTC TAT TTT GGT TTC AAT 



GAT CCC 
Asp Pro 
60 

AAA GAC 
Lys Glu 
75 
TGG GAG 
Trp Glu 

CM GCG 
Glu Ala 

CAG TTT 
Gin Phe 

CTC CCT 
Leu Gly 
140 
CCT CM 
Pro Gin 
155 

GCC ACT 
Ala Ser 

TCG CTG 
Ser Leu 

TTC ATA 
Leu He 

GCC ACC 
Gly Thr 
220 
GGT CGC 
Gly Gly 
235 

GGA AAC 
Gly Asn 

ACG CM 
Thr Gin 

CTG TTA 
Leu Leu 



45 

ACT GCC ATT CM 
Ser Ala He Glu 



*SHJ¥ 1 1 -4 6 7 7 0 
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TAT CTC 
Tyr Val 

TCA GGG 
Ser Gly 

CAG ATT 
Gin He 
110 
CTG GAC 
Leu Asp 
125 

ACT CTC 
Thr Val 

CTG GCG 
Leu Ala 

TAC ATC 
Tyr He 

TTC GTG 
Leu Val 
190 
CAC GCG 
His Ala 
205 

GCC CCA 
Gly Ala 

CM CTC 
Glu Val 

MG ATT 
Lys He 

GCC CTC 
Ala Val 
270 
AGC CAG 
Ser Gin 
285 

CGC ATG 
Arq Met 



CM CTC 
Glu Leu 
80 

MG CTC 
Lys Val 
95 
CAG CAG 
Gin Gin 

TAT TCA 
Tyr Ser 

CCT TTT 
Pro Phe 

AM CTG 
Lys Leu 
160 
GM CAT 
Glu Asp 
175 

CGC GCC 
Gly Gly 

CTG CAG 
Leu Glu 

TTA. GTT 
Leu Val 

GTG TTA 
Val Leu 
240 
CM GCC 
Glu Ala 
255 

GCG TCA 
Ala Ser 

CAC CCT 
His Pro 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



ACT AAG CCC ATG ACT MC 912 
Thr Lys Arq Met Ser Asn 
300 

CAC CAT CAT CAT CAG CTC 960 
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27 28 
Ser Leu Phe Val Leu Tyr Phe Civ Leu Asn His His His Asp Gin Leu 
305 310 315 320 

CCC CAT CAC ACG GTT TGT TTC GGC CCG CGT TAC CGC GAG CTG ATT CAC 1008 
Ala His His Thr Val Cys Phe Glv Pro Arq Tyr Arq Glu Leu lie Asp 

325 330 335 

CAA ATT TTT AAT GAT CAT GGC CTC CCA GAG CAC TTC TCA CTT TAT CTG 1056 
Glu He Phe Asn His Asp Gly Leu Ala Glu Asp Phe Ser Leu Tyr Leu 

340 345 350 

CAC GCG CCC TGT CTC ACG GAT TCG TCA CTG GCG CO" GAA CGT TGC GGC 1104 
His Ala Pro Cys Val Thr Asp Ser Ser Leu Ala Pro Glu Gly Cys Gly 

355 360 365 

ACT TAC TAT CTG TTG GCG CCC CTG CCG CAT TTA CGC ACC GCG AAC CTC 1152 
Ser Tyr Tyr Val Leu Ala Pro Val Pro His Leu Gly Thr Ala Asn Leu 

370 375 380 

CAC TGC ACG CTT GAG GGC CCA AAA CTA CGC CAC CCT ATT TTT CCC TAC 1200 
Asp Trp Thr Val Glu Gly Pro Lys Leu Arq Asp Arq lie Phe Ala Tyr 
385 390 395 400 

CTT GAG CAG CAT TAC ATG CCT GGC TTA CGC ACT CAG CTG CTC ACG CAC 1248 
Leu Glu Gin His Tyr Met Pro Gly Leu Arq Ser Gin Leu Val Thr His 

405 410 415 

CGG ATG TTT ACG CCG TTT GAT TTT CGC GAC CAG CTT AAT CCC TAT CAT 1296 
Arq Met Phe Thr Pro Phe Asp Phe Arq Asp Gin Leu Asn Ala Tyr His 

420 425 430 

GGC TCA GCC TTT TCT CTG GAG CCC GTT CTT ACC CAG ACC CCC TGG TTT 1344 
Gly Ser Ala Phe Ser Val Glu Pro Val Leu Thr Gin Ser Ala Trp Phe 

435 440 445 

CGG CCG CAT AAC CGC GAT AAA ACC ATT ACT AAT CTC TAC CTG CTC GGC 1392 
Arq Pro His Asn Arq Asp Lys Thr He Thr Asn Leu Tyr Leu Val Gly 

450 455 460 

CCA GGC ACG CAT CCC GGC GCA GGC ATT CCT GGC CTC ATC CGC TCG GCA 1440 
Ala Gly Thr His Pro Gly Ala Gly He Pro Gly Val He Gly Ser Ala 
465 470 475 .480 

AAA CCG ACA CCA GCT TIG ATG CTG GAG GAT CTG ATT TGA 1479 
Lys Ala Thr Ala Gly Leu Met Leu Glu Asp Leu He * 
485 490 
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ATG CAA CCG CAT TAT CAT CTG ATT CTC CTG GGG GCT GGA CTC GCG AAT 48 
Met Gin Pro His Tyr Asp Leu He Leu Val Gly Ala Gly Leu Ala Asn 

15 10 15 

GGC CTT ATC GCC CTC CCT CTT CAG CAG CAG CAA CCT GAT ATG CCT ATT 96 
Glv Leu He Ala Leu Arq Leu Gin Gin Gin Gin Pro Asp Met Arq He 

20 25 30 

TTG CTT ATC GAC GCC CCA CCC CAG GCG CGC GGG AAT CAT ACG TGG TCA 144 
Leu Leu He Asp Ala Ala Pro Gin Ala Gly Gly Asn His Thr Trp Ser 
35 40 45 



29 

TIT CAC CAC GAT 
Phe His His Asp 
50 

CTC CTG CTT CAT 
Leu Val Val His 
65 

CGT CCT AAG CTG 
Arq Arq Lys Leu 



OCT GAG 
Ala Clu 

GCG CTC 
Ala Val 

CTT ATC 
Val lie 
130 
TCA GCA 
Ser Ala 
145 

TTG AGC 
Leu Ser 

GTC GAT 
Val Asp 

CCG ACC 
Pro Thr 

TTA GAT 
Leu Asp 
210 
CAG GGT 
Gin Gly 
225 

CCC ATT 
Pro He 

CTG GCC 
Leu Ala 

TAT TCA 
Tyr Ser 

CAT GTC 
Asp Val 
290 
CGC GAG 
Arq Glu 



CTT TTA 
Val Leu 
100 
GCA GAG 
Ala Glu 
115 

GGT GCC 
Gly Ala 

CTG AGC 
Leu Ser 

CAC CCG 
His Pro 

CAG CAA 
Gin Gin 
180 
AGA TTG 
Arq Leu 
195 

CCT GAA 
Pro Glu 

TGG CAG 
Trp Gin 

ACT CTG 
Thr Leu 

TCT ACT 
Cys Ser 
260 
CTG CCC 
Leu Pro 
275 

TTT ACG 
Phe Thr 

CGC TGG 
Arq Trp 



GAT TTG ACT 
Asp Leu Thr 
55 

CAC TGG CCC 
His Trp Pro 
70 

AAC AGC GGC 
Asn Ser Gly 
85 

CAG CGA CAG 
Gin Arq Gin 

CTT AAT GCG 
Val Asn Ala 

CGC GCG CTG 
Arq Ala Val 
135 

CTG GGC TTC 
Val Gly Phe 

150 
CAT OCT TTA 
His Gly Leu 
165 

AAT GGT TAT 
Asn Gly Tyr 

TTA ATT GAA 
Leu He Glu 

TGC GCC COG 
Cys Ala Arq 
215 

CTT CAG ACA 
Leu Gin Thr 

230 
TCG GGC AAT 
Ser Gly Asn 
245 

GCA TTA CCT 
Gly Leu Arq 

CTG GCG CTT 
Leu Ala Val 

TCG GCC TCA 
Ser Ala Ser 
295 

CAG CAG CAC 
Gin Gin Gin 



(16) 

GAG AGC CAA 
Glu Ser Cln 

CAC TAT CAG 
Asp Tyr Cln 

TAC TTT TCT 
Tyr Phe Cys 
90 

TTT CGC CCC 
Phe Gly Pro 

105 
GAA TCT CTT 
Glu Ser Val 
120 

ATT GAC GCG 
He Asp Gly 

CAG GCC TTT 
Cln Ala Phe 

TCG TCT CCC 
Ser Ser Pro 
170 

CCC TTC CTC 
Arq Phe Val 

185 
GAC ACG CAC 
Asp Thr His 
200 

CAA AAT ATT 
Gin Asn lie 

CTC CTG CGA 
Leu Leu Arq 

GCC GAC GCA 
Ala Asp Ala 
250 

GCC CCT CTG 
Ala Gly Leu 

265 
GCC CTG GCC 
Ala Val Ala 
280 

ATT CAC CAT 
He His His 

GGC TTT TTC 
Gly Phe Phe 



CAT CCT 
His Arq 
60 

CTA CGC 
Val Arq 

75 
ATT ACT 
He Thr 

CAC TTG 
His Leu 

CCG TTG 
Arq Leu 

CGC GGT 
Arq Gly 
140 
ATT GGC 
He Gly 
155 

ATT ATC 
He He 

TAC AGC 
Tyr Ser 

TAT ATT 
Tyr He 

TGC GAC 
Cys Asp 
220 
GAA GAA 
Glu Glu 
235 

TTC TGG 
Phe Trp 

TTC CAT 
Phe His 

GAC CGC 
Asp Arq 

GCC ATT 
Ala He 
300 
CGC ATG 
Arq Met 



TGC ATA CCT CCC 
Trp He Ala Pro 
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TTT CCC 
Phe Pro 

TCT CAC 
Ser Gin 

TGG ATC 
Trp Met 
110 
AAA AAG 
Lys Lys 
125 

TAT GCG 
Tyr Ala 

CAG CAA 
Cln Clu 

ATG GAT 
Met Asp 

CTG CCG 
Leu Pro 
190 
GAT AAT 
Asp Asn 
205 

TAT CCC 
Tyr Ala 

CAG GGC 
Cln Gly 

CAG CAG 
Gin Gin 

CCT ACC 
Pro Thr 
270 
CTG ACT 
Leu Ser 
285 

ACG CAT 
Thr His 



ACA CGC 
Thr Arq 
80 

CGT TTC 
Arq Phe 

95 
GAT ACC 
Asp Thr 

CCT CAG 
Gly Gin 

CCA AAT 
Ala Asn 

TCG CGA 
Trp Arq 
160 
CCC ACC 
Ala Thr 
175 

CTC TCG 
Leu Ser 

CCG ACA 
Ala Thr 

CCG CAA 
Ala. Gin 

GCC TTA 
Ala Leu 
240 
CGC CCC 
Arq Pro 
255 

ACC GGC 
Thr Civ 

OCA CTT 
Ala Leu 

ttt ax 

Phe Ala 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



CTC AAT CCC ATG 960 
Leu Asn Arq Met 
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305 310 
CTC TTT TTA GCC GGA CCC GCC GAT 
Leu Phe Leu Ala Gly Pro Ala Asp 
325 

TTT TAT GOT TTA COT CM GAT TTA 
Phe Tyr Gly Leu Pro Glu Asp Leu 
340 

CTC ACG CTG ACC GAT CGG CTA CGT 
Leu Thr Leu Thr Asp Arq Leu Arq 
355 360 
CCG GTA TTA GCA GCA TTG CAA GCC 
Pro Val Leu Ala Ala Leu Gin Ala 
370 375 

[0 04 1 ] 
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315 320 
TCA CGC TGG CGG GTT ATG CAG CGT 1008 
Ser Arq Trp Arq Val Met Gin Arq 

330 335 
ATT GCC CGT TTT TAT GCG GGA AAA 1056 
He Ala Arq Phe Tyr Ala Gly Lys 
345 350 
ATT CTG ACC GGC AAG CCG COT GTT 1104 
lie Leu Ser Gly Lys Pro Pro Val 
365 

ATT ATG ACG ACT CAT CGT TAA 1149 
He Met Thr Thr His Arq * 
380 
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